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Abstract. We call a digraph h-semicomplete if each vertex of the di¬ 
graph has at most h non-neighbors, where a non-neighbor of a vertex v 
is a vertex such that there is no edge between u and v in either 

direction. This notion generalizes that of semicomplete digraphs which 
are O-semicomplete and tournaments which are semicomplete and have 
no anti-parallel pairs of edges. Our results in this paper are as follows. 
(1) We give an algorithm which, given an h-semicomplete digraph G on 
n vertices and a positive integer k, in (h + 2 k + 1 ) 2k n°^ time either 
constructs a path-decomposition of G of width at most k or concludes 
correctly that the pathwidth of G is larger than k. (2) We show that 
there is a function f(k, h) such that every h-semicomplete digraph of 
pathwidth at least f(k , h) has a semicomplete subgraph of pathwidth at 
least k. 

One consequence of these results is that the problem of deciding if a fixed 
digraph H is topologically contained in a given h-semicomplete digraph 
G admits a polynomial-time algorithm for fixed h. 


1 Introduction 

A tournament is a digraph obtained from a complete graph by orienting each 
edge. A semicomplete digraph generalizes a tournament, allowing each pair of 
distinct vertices to optionally have two edges in both directions between them. 
Tournaments and semicomplete digraphs are well-studied (see for example) 
and have recently been attracting renewed interests in the following context. 

There are many problems on undirected graphs that admit polynomial time 
algorithms but have digraph counterparts that are NP-complete. For example, 
Robertson and Seymour [IS) , in their Graph Minors project, proved that the k 
disjoint paths problem (and the k edge-disjoint paths problem) can be solved in 
polynomial for fixed k. On the other hand, digraph versions of these problems are 
NP-complete even for k = 2 due to Fortune, Hopcroft, and Wyllie [8|. Recently, 
Chudnovsky, Scot, and Seymour [5] showed that the k directed disjoint paths 
problem can be solved in polynomial time for fixed k if the digraph is restricted 


to be semicomplete. The edge-disjoint version of the problem is also polynomial 
time solvable on semicomplete digraphs, due to Fradkin and Seymour HU. The 
situation is similar for the topological containment problem, which asks if a 
given graph (digraph) contains a subgraph isomorphic to a subdivision of a 
fixed graph (digraph) H: the undirected version is polynomial time solvable 
due to the disjoint paths result and the directed version is NP-complete on 
general digraphs [8], while the question on semicomplete digraphs is polynomial 
time solvable due to Fradkin and Seymour m and moreover is fixed-parameter 
tractable due to Fomin and Pilipczuk mm- In addition to these algorithmic 
results, some well-quasi-order results that are similar to the celebrated Graph 
Minors theorem of Robertson and Seymour |19j have been proved on the class of 
semicomplete digraphs EE3- These developments seem to suggest that the class 
of semicomplete digraphs is a promising stage for pursuing digraph analogues of 
the splendid outcomes, direct and indirect, from the Graph Minors project. 

Given this progress on semicomplete digraphs, it is natural to look for more 
general classes of digraphs on which similar results hold. Indeed, the results 
on disjoint paths problems cited above are proved for some generalizations of 
semicomplete digraphs. The vertex-disjoint path algorithm given in [5] works for 
a digraph class called d-path dominant digraphs, which contains semicomplete 
digraphs ( d = 1) and digraphs with multipartite underlying graphs (d= 2). The 
edge-disjoint path algorithm given in m works for digraphs with independence 
number (of the underlying graph) bounded by some fixed integer. On the other 
hand, the results for topological containment in psisni] are strictly for the class 
of semicomplete graphs. 

The pathwidth of digraphs, which plays an essential role in some of the above 
results, is defined as follows. Let G be a digraph. A path-decomposition of G is 
a sequence (X\,... ,X m ) of vertex sets X,; C V(G), called bags, such that the 
following three conditions are satisfied: 

1- Ul <i<m X i = V(G), 

2. for each edge (u, v ) of G, u £ X, and v £ Xj for some i > j, and 

3. for every v € V(G), the set {i \ v £ Xi} of indices of the bags containing v 
forms a single integer interval. 

The first and the third conditions are the same as in the definition of the path- 
width of undirected graphs; the second condition, on each edge, is different and 
depends on the direction of the edge. Note that some authors, including the 
present authors in previous work in different contexts, reverse the direction of 
edges in this condition. We follow the convention of the papers cited above. As in 
the case of undirected graphs, the width of a path-decomposition (Ai,... ,X m ) 
is maxi<j< m \Xi\ — 1 and the pathwidth of G, denoted by pw(G), is the smallest 
integer k such that there is a path-decomposition of G of width k. 

Unlike for the pathwidth of undirected graphs, which is linear-time fixed- 
parameter tractable [3], no FPT-time algorithm is known for computing the 
pathwidth of general digraphs: only XP-time algorithms (of running time n°®) 
are known. The third author of the current paper proposed one in |14j . which 
was unfortunately flawed and has recently been corrected in l!2l by the current 
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and two more authors. Another XP algorithm is due to Nagamochi m, which is 
formulated for a more general problem of optimizing linear layouts in submodular 
systems. 

In this paper, we consider another direction of generalizing semicomplete 
digraphs and study the pathwidth of digraphs in the generalized class. For non¬ 
negative integer h, we say that a simple digraph G is /i-semicomplete if each 
vertex of G has at most h non-neighbors, where a non-neighbor of vertex v is 
a vertex u distinct from v such that there is no edge of G between u and v 
in either direction. Thus, semicomplete digraphs are O-semicomplete. Our main 
results are as follows. 

Theorem 1. There is an algorithm which, given an h-semicomplete digraph 
G on n vertices and a positive integer k, in (h + 2k + 1 ) 2k n°^ 1 ' 1 time either 
constructs a path-decomposition of G of width at most k or concludes correctly 
that the pathwidth is larger than k. 

This theorem generalizes the k°^n 2 time result of Pilipczuk El on semicom¬ 
plete digraphs. Compared on semicomplete digraphs, his algorithm has smaller 
dependence on n (our 0(1) exponent on n is naively 4), while the hidden constant 
in the exponent on k can be large. 

Theorem 2. There is a function f{h , k) on positive integers h and k such that 
each h-semicomplete digraph with pathwidth at least f(h , k) has a semicomplete 
subgraph of pathwidth at least k. 

The topological containment result in |T0] is based on two components. One is 
a combinatorial result that, for each fixed digraph if, there is a positive integer k 
such that every semicomplete digraph G of pathwidth larger than k topologically 
contains H. The second component is a dynamic programming algorithm that, 
given a digraph G on n vertices together with a path-decomposition of width k 
and a digraph H on r vertices with s edges, decides if G topologically contains 
H in 0(n 3 ( k+rs ' >+4 ) time. Note that this algorithm does not require G to be 
semicomplete. Theorem [2] enables us to generalize the first component to h- 
semicomplete digraphs and Theorem [T] gives us the path-decomposition to be 
used in the dynamic programming. Thus, we have the following theorem. 

Theorem 3. For fixed positive integer h and fixed digraph H, the problem of 
deciding if a given h-semicomplete digraph topologically contains H can be solved 
in polynomial time. 

We should remark that extending the FPT result of uni in this direction 
using the approach of this paper appears difficult, as the FPT-time dynamic 
programming algorithm therein heavily relies on the strict semicompleteness of 
the input digraph. 


Techniques Our algorithm in Theorem |T] borrows the notion of separation 
chains from Hzj but the algorithm itself is completely different from the one 
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in m- The advantage of our algorithm is that it works correctly on general di¬ 
graphs, in contrast to the one in m which is highly specialized for semicomplete 
digraphs. We need a property of /i-semicomplete digraphs only in the analysis 
of the running time. 

Our algorithm is based on the one due to Nagamochi [ 15] for more general 
problem of finding an optimal linear layout for submodular systems. Informally, 
his algorithm applied to the pathwidth computation works as follows. Fix di¬ 
graph G and let d + (U) for each U C V(G) denote the number of out-neighbors 
of U. The width of permutation n of V ( G ) is defined to be the maximum of 
d + (V ( 7 r')) where 7 r' ranges over all the prefixes of 7r and V ( 7 r') denotes the set of 
vertices in n'. The smallest integer k such that there is a permutation of width 
k is called the vertex separation number of G and is equal to the pathwidth of 
G [20] . Thus, our goal is to decide, given k. if there is a permutation of V(G) of 
width at most k. 

Nagamochi’s algorithm is a combination of divide-and-conquer and branching 
from both sides of the permutation. For disjoint subsets S and T of V(G), call a 
permutation 7r of V (G) an (S, T)-permutation, if it has a prefix id with V ( 7 r 7 ) = S 
and a suffix n" with V(ir") = T. A vertex set X that minimize d + (X) subject to 
S C X C V(G) \T is called a minimum (S,T)-separator. A crucial observation, 
based on the submodularity of set function d + is the following. Let X be a 
minimum (S, T)-separator. Then, if there is an (S, Tj-permutation of width at 
most k then there is such a permutation that is an (S, V (G) \ A)-permutation 
and an ( X , Tj-permutation at the same time. Thus if there is a minimum (S, T)- 
separator distinct from both S and V (G) \ T, then we can divide the problem 
into two smaller subproblems. When there is no minimum (S, T)-separator other 
than S or V (G) \ T, we need to branch on vertices to add to S or T. For general 
digraphs, the running time is n 2k +°( 1 )- we need to branch on 0(n) vertices from 
both sides, and the depth of branching is bounded by k, as the value d + (X) of 
the minimum separator X increases at least by one after we branch from both 
sides. 

For h-semicomplete digraphs, we observe that the number of vertices v such 
that d + (S U {u}) < k is at most h + 2k + 1 (see Proposition [T]) and therefore, 
we need to branch on at most h + 2k + 1 vertices when extending from S. 
Unfortunately, we do not have a similar bound on the number of vertices to 
branch on from the side of T. For example, if|Tj < k, then d + (V r (G)\(TU{u})) < 
k for every v qL T and therefore we need to branch on every vertex not in 
TU5U N + (S ), where N + (S) denotes the set of out-neighbors of S. 

This asymmetry comes from the asymmetry inherent in the vertex separation 
number characterization: the width of a permutation 7r in G is not equal in 
general to the width of a reversal of n in G~ 1 , the digraph obtained from G 
by reversing all of its edges. We use separation chains m to give a symmetric 
characterization of pathwidth and formulate a variant of Nagamochi’s algorithm 
which branches from each side on at most {h + 2k + 1) vertices. This is how we 
get the running time stated in Theorem [l] We remark that a similar result on 
cutwidth is an immediate corollary of the Nagamochi’s result, since we have the 
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desired symmetry in the definition of cutwidth: the cutwidth of a permutation 
7r in G equals the cutwidth of the reversal of n in G _1 . 

The scenario for the combinatorial result in Theorem [5] is rather straight¬ 
forward. Given an h-semicomplete graph G of pathwidth at least f(h, k), we 
complete it into a semicomplete graph G' on V(G), which must have pathwidth 
at least f(h, k). We then find an obstacle T C V (G) in G' for small pathwidth, of 
one of the types defined in m- Then we consider a random semicomplete sub¬ 
graph G" of G and show that G" inherits an obstacle T' from T with high prob¬ 
ability such that the existence of T' in G" implies pw(G") > k. We need to over¬ 
come, however, some difficulties in carrying out this scenario. To be more specific, 
consider one type of obstacles, namely degree tangles m- An (l, k)- degree tangle 
of G is a vertex set T with \T\ = l such that max„ erd+(v) — min„ e j’ d + (v) < k. 
In order for a degree tangle T in G' to give rise to a degree-tangle T' of the 
random subgraph G", we need the out-degrees of vertices in T' to “shrink” al¬ 
most uniformly. To this end, we wish our sampling to be such that (1) each 
vertex v £ I/(G) is in V(G") with a fixed probability p and (2) for each vertex 
set S C V(G), the intersection S D V(G") has cardinality sharply concentrated 
around its expectation pis'!. The following theorem, which may be of indepen¬ 
dent interest, makes this possible: we apply this theorem to the complement of 
the underlying graph of G with d = h. 

Theorem 4. Let G be an undirected graph on n vertices with maximum degree 
d or smaller. Let p = 2 d+i ■ Then, it is possible to sample a set I of independent 
vertices of G so that Pr(?; £ I) = p for each v £ V(G) and, for each S C V(G), 
we have 


PrdS 1 H I\ > p|S| +t)< exp (-g|£|) 

and 

PrdS D I\ < p\S\ -t)< exp • 

Even with this sampling method, it is still not clear if we can have the desired 
“uniform shrinking” of out-degrees of the vertices in the degree tangle, since if 
the set S of out-neighbors of a vertex has cardinality l?(n), then the deviation 
of |5 n P(G")| from its expectation p|£| is necessarily 17 (y/n). To overcome this 
difficulty, we introduce several types of obstacles that are robust against random 
sampling and show that (1) if G' has an obstacle of a type in Jb7i then it has a 
robust obstacle and (2) each robust obstacle in G' indeed gives rise to a strong 
enough obstacle in G(V") with high probability. 

A conference version of this paper will appear as m ■ The rest of this paper 
is organized as follows. In Section [2] we define some notation. In Section O we 
describe our algorithm and prove Theorem [l] In Section [4] we prove Theorem [2J 
assuming Theorem [4] Finally in Section 0 we prove Theorem |4] 
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2 Notation 


Digraphs in this paper are simple: there are no self-loops and, between each pair 
of distinct vertices, there is at most one edge in each direction. For digraph G, 
V(G) denotes the set of vertices of G and E(G) C V{G) x V(G) the set of edges 
of G. If (u,v) G E(G), then v is an out-neighbor of u and u is an in-neighbor 
of v. For each v G V(G), we denote the set of in-neighbors of v by Nq(v) = 
{it | (u,v) G E(G)} and write IVq[i;] for Nq(v) U {u}. For U C V(G), we define 
Nc\U] = Nq [w] and Nq(U) = Nq[U] \ U. We define the notation for 

out-neighbors N + similarly. In this paper, the in-degree and out-degree of vertex 
v in G, denoted by dg( v) and d£(v), respectively, counts the in-neighbors and 
out-neighbors rather than the incoming and outgoing edges: d@(v) = |.ZVq(i;)| 
and d^(v) = |ATq(i;)|; we also define dg(t/) = \Nq(U)\ and dg(Cf) = \Nq(U)\ 
for U C V(G). We omit the reference to G from the above notation when it is 
clear from the context which digraph is meant. 


3 Algorithm 

In this section, we describe the algorithm claimed in Theorem |T[ prove its cor¬ 
rectness, and analyze its running time. As suggested in the introduction, our 
first task is to give a symmetric characterization of pathwidth to which the 
Nagamochi’s algorithm is adaptable. 

Let G be a digraph. A pair (A, B) of vertex sets of G is a separation of G if 
A U B = V and there is no edge from A \ B to B \ A. The order of separation 
( A,B) is \A fl B\. For S, T C V such that S D T = 0, separation ( A,B) is an 
S-T separation if S fl B = 0 and T fl A = 0. We call an S-T separation ( A , B ) 
trivial if B = V (G) \ S or A = V ( G ) \ T. 

An important role in our algorithm is played by a minimum S-T separation, 
which is defined to be an S-T separation of the smallest order. Note that if a 
minimum S-T separation is trivial, then it must be either (7V + [S'], V(G) \ S) or 
(V(G) \ T, AT _ [T]). As will be seen later, we may use non-trivial minimum S-T 
separations to divide-and-conquer subproblems in our pathwidth computation. 

A sequence of separations ((Aq, Bo), (A±, B±),..., (A r , B r )) is a separation 
chain if Aq C A\ C ... C A r and B r C B r ._ 1 C ... C Bo. The order of this sepa¬ 
ration chain is the maximum order of its member separations. We use operator + 
for concatenating sequences of separations and for appending a separation to a 
sequence of separations: for sequences C and C' of separations and a separation 
(A, B ), C + C' is the concatenation of C and C', ( A , B) + C is the sequence C 
preceded by (A, B ), and C + (A, B) is the sequence C followed by (A, B). 

Let C = ((A 0 , Bo), (Ai, B 2 ),..., (A r , B r )) be a separation chain. We say that 
C is gapless if, for every 0 < i < r, either |Aj\Ai_i| < 1 or \Bi_i\Bi\ < 1 holds. 
Note that this definition allows a repetition of an identical separation. We say 
that C is an S-T chain , if B 0 = V (G) \ S and A r = V (G) \ T, that is, both ends 
of G are trivial S-T separations. Note that every separation in an S-T chain is 
an S-T separation. 
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As observed in EZI, 

(1) if (X 1 ,X 2 , ■ ■ ., X r ) is a path-decomposition of G then ((A 0 , B 0 ), (Ai, B{), 

..., (A r , B r )), where Ai = (Jj<i Xj and B t = Ui<j Xj, is an 0-0 chain in G, and 

(2) if ((Ao,B 0 ), {Ai,B{), ..., (A r , B r )) is an 0-0 chain in G, then (Wi, W 2 ,W r ), 
where Wi = Ai D .Bj_i for 1 < i < r, is a path-decomposition of G. 

These observations lead to the following characterization of pathwidth by 
means of gapless separation chains. 

Lemma 1. Digraph G has a path-decomposition of width k if and only if it has 
a gapless 0-0 chain of order k. 

Proof. Suppose G has a path-decomposition {X\,X 2 ,...,X r ) of width k. We 
may assume that this path-decomposition is nice: A'i = X r = 0 and, for 1 < i < 
r, either X i+ i = A f U {u} for some v G V(G) \ X, or X i+:i = Xj \ {?;} for some 
v € A,;. If we set A* = (J Xj and II, = Uj>i Xj for 0 < i < r as in observation 
(1), then ((A 0 , B 0 ), (A 2 , B 2 ),..., (A r , B r )) is a gapless 0-0 chain. The order of 
this separation chain is inaxo<j< r |Aj D Bi\ = maxi<j< r _i |X,; D X,; + 1 | = k. 
Conversely, suppose a gapless separation chain ((A 0 , Bo), {A\,Bi), ..., (A r , B r )) 
of order k is given. We set Xj = Aj D i?j_i for 1 < * < r. Then, (Xi, X 2 ,..., X, r ) 
is a path-decomposition by observation (2). Since our separation chain is gapless, 
we have either |Aj\Aj_i| < 1 or |I3j_i\Bj| < 1 for 1 < i < r. In the former case, 
we have |A* D 5j-i| < |Aj_i D I?j-i| + 1 = fc + 1 and, in the latter case, we have 
|Aj D Bi- 1 | < |Aj fl Bi\ + 1 = k + 1. Therefore, the width of path-decomposition 
(Xi,X 2 ,... ,X r ) is at most k and hence G has a path-decomposition of width 
k. □ 


We say that a pair (5, T) of vertex sets of G is k-admissible if X + [S'] flT = 0 
(and hence S fl X _ [T] = 0), d + (S) < k, and d~(T) < k. It is clear that ( S,T ) 
must be fc-admissible in order for G to have a gapless S-T chain of order at 
most k. Our algorithm solves the following problem with parameter k: given 
digraph G and a fc-admissible pair (S, T ), compute a gapless S-T chain of order 
at most k if one exists and otherwise report the non-existence. The algorithm 
in Theorem |T] applies this algorithm to ( S,T ) = (0,0) and, if it returns an 0-0 
chain of order k, converts it to a path-decomposition of width at most k, using 
the proof of Lemma [l] 

The following lemma provides the base case for our algorithm. 

Lemma 2. If pair ( S , T) is k-admissible and satisfies \V{G) \(5UT)| < k + 1 
then G has a gapless S-T chain of order at most k. 

Proof. The proof is by induction on \V{G) \ (5 U T)\. The base case is where 
V(G) \ (S U T) = N + (S) = N~(T). The statement holds in this case, since 
the separation (X + [S'],X“[T]) alone forms a gapless S-T chain. Since (S, T) is 
fc-admissible, the order of this separation chain is at most fc. Therefore, the base 
case holds. 

Suppose that either V(G) \ (S U T) / N+{S) or V(G) \ (S U T) ± N~(T). 
Consider the first case: we have some v fL X+fiS 1 ] U T. If we set T' = T U {u}, 
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then as v ^ N + (S ), we have N~(T') C V(G) \(SUTU {i;}) and hence we have 
\N~(T')\ < k. We also have _/V + [S'] flT' = 0 since v £ N + (S). Therefore, (S,T r ) 
is fc-admissible. Moreover, we have | V(G) \ (S U T')\ < | V(G) \ (S U T)| < k+ 1. 
Therefore, we may apply the induction hypothesis to ( S,T ') and have a gapless 
S-T' chain C’ of order at most k. Let ( A , B) be the last separation of C'. Then, 
since A = (V(G)\T') C (V(G)\T) and B D N~[T'] D N~[T], C = C'+ (V(G)\ 
T,N~[T]) is an S-T chain. Since C' is gapless and (V(G) \ T) \ A = {w}, C 
is also gapless. Moreover, since the order of C is at most k and the order of 
(V(G) \T, N~[T]) is at most |1V _ (T , )| < k, the order of C is at most k. The 
second case is similar and symmetric to the first case. □ 

We have two types of recurrences: divide-and-conquer and branching. For the 
recurrence of first type, we need the following lemma. 

Lemma 3. Suppose ( X , Y) is a minimum S-T separation. Then, for each S-T 
separation {A, B ), both(AnX , BUY) and (AuX, BnY) are S-T separations 
and moreover neither of their orders exceed that of ( A,B ). 

Proof Let A ± = A\B, A 2 = An B, A 3 = B\ A, X 1 = X\Y, X 2 = X n Y, 
and X 3 = Y \ X. Then, both {A\, A 2 , A 3 ) and (Xi , X 2 , X 3 ) partition of V(G). 
We have 


(in!)\(BU Y) =A 1 nX 1 and 
(BUY)\{An X) =A 3 UX 3 

and, since there is no edge from A\ to A 3 and no edge from X 3 to X 3 , there is 
no edge from (j4nX)\(BUF) to (BUF)\ {AnX). Therefore, (AnX, BUY ) is 
a separation and, similarly, (AuX, BnY ) is a separation. Since SnB = 0 and 
Sflb = 0, we have S D (B U Y) = 0 and similarly (^4 D X) D T = 0. Therefore, 
(. A fl X, B U Y) is an S-T separation and, similarly, ( A U X , Bn Y) is an S-T 
separation. 

To prove the claim on the orders of these separations, we first claim that 

\AnB\ + \xnY\ = |(Ani)n(BuF)| + |(dui)n(5nF)|. (l) 

To see this, note that A fl B = A 2 is partitioned into A 2 fl X\, A 2 fl X 2 , and 
A 2 n X 3 ; X n Y = X 2 is partitioned into Ai n X 2 , A 2 C\ X 2 , and A 3 fl X 2 . On 
the other hand, (AnJf)n(BU Y) is partitioned into A\ fl X 2 , A 2 fl X 2 , and 
A 2 fl X\ ; (AUl)n(Hn Y) is partitioned into A 3 fl X 2 , A 2 fl X 2 , and A 2 fl X 3 . 
Comparing these lists, we see that both sides of m count the same set of vertices 
with the same multiplicity. Since (X, Y) is a minimum S-T separation, we have 
\XnY\ < |(AUl)n(Bn7)| and hence |(AnX)n(5U V)| < \AnB\ by (HJ); 
similarly we have |(A U X) fl (B fl F)| < \ A fl B\. □ 

The following lemma, which corresponds to the main lemma in m underly¬ 
ing the algorithm for submodular systems, provides the divide-and-conquer type 


recurrence. 


Lemma 4. Suppose G has a gapless S-T chain of order k and let (X, Y) be a 
minimum S-T separation of G. Then G has a gapless S-T chain of order at 
most k of the form C± + (X,Y) + C 2 , where C\ is a gapless S-(Y \ X) chain 
and C 2 is a gapless (X \ Y) -T chain. 

Proof. Let C = ((A 0 , B 0 ), (Ai, Bi), ..., (A r , B r )) be an arbitrary gapless S-T 
chain of order at most k. Recall that B 0 = V(G) \ S and A r = V(G) \ T by 
the definition of S-T chains. Consider the sequence of separations C\ consisting 
of (At n X,B t U Y) for 0 < * < r. Since we have A.;_ 1 fl X C Ai C I X and 
B{ UFC Bi_ 1 U Y for 0 < i < r, C\ is a separation chain. Since X fl T = 0 and 
A r = V ( G)\T , we have A r nX = X'. Therefore, C\ is an S-(Y\X) chain, since 
we have V(G) \ ( B 0 U Y) = S and V(G) \ (. A r n X) = V(G) \X = Y\X. Since C 
is gapless, we have, for each 0 < i < r, either \A t \ Aj_i| < 1 or \Bi_\ \ Bi\ < 1. 
In the former case, we have |(AinX)\(A;_i flX)| < 1 and, in the latter case, we 
have Uh)\ {Bi U Y)\ < 1. Therefore, the separation chain C\ is gapless. 

By Lemma 03 the order of C\ is at most k. We similarly construct a gapless 
(A' \ Y)-T chain C 2 of order at most k. 

Since the last separation of C\ is (A r fl X, B r U Y) = (X,B r U Y) and the 
first separation of C 2 is (A 0 U A', B 0 fl Y) = (A 0 U A', Y), the concatenation C\ + 
(X, Y) + C 2 is a separation chain and is moreover gapless. Since this separation 
chain is of order at most k and is an S-T chain, the lemma holds. □ 

We need some preparations before formulating the branching type recurrence. 
We say that an S-T separation chain C = ((A 0 , B 0 ), {Ai,B \),..., (A r ,B r )) is 
nice if, for every 0 < i < r, we have |A i+ i \ A^| < 1 and \B t \ B i+ 1 | < 1. We say 
C is tight if Aq = X + [S'] and B r = N~[T], 

Lemma 5. If G has a gapless S-T chain of order at most k then it has a tight, 
nice, and gapless S-T chain of order at most k. 

Proof. To each S-T chain C = ((A 0 , B 0 ), (Ai, B\), ..., (A r , B r )), we assign a 
non-negative integer S(C) by 

S(C) = \A 0 \N+[S]\ + \B 0 \N-[T}\ 

+ ^ (max{0, |A i+ i \ A t \ — 1} + max{0, \B, \ B l+1 \ - 1}). 

0<2<r 

Choose a gapless S-T chain C = ((A 0 , B 0 ), ..., (A r , B r )) to minimize 

(5(C) subject to being of order at most k. If (5(C) = 0 then C is tight and nice 
and we are done. For contradiction, suppose <5(C) > 0. We first consider the 
case where there is some vertex v G Aq \ N + [S']. Let C' be obtained from C by 
adding separation (Ao \ {v}, Bq) before C. Then, C' is a gapless S-T chain. The 
order of separation (Ao \ {r;}, Bq) is smaller than that of (Ao, Bo) and hence the 
order of C' is at most k. This contradicts the choice of C since <5(C') = S(C) — 1. 
We have similarly a contradiction if there is some v £ B r \ N~[T}. Suppose 
finally that |Ai + i \ A^| > 2 for some 0 < i < r. Let v and v' be two distinct 
vertices in Ai + \ \ Ai. Now, since C is gapless, this assumption implies that 
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|Bi \ Bi + 1 ] < 1. As neither v nor v' is in A; and hence both are in Bi, it follows 
that |A i+ in_B,: + i| > lAjPlBil + l. Since |(AjU{u})n.Bi| = |Ajn.Bj| + l, the order 
of separation (A* U { v},Bi ) is no greater than that of (A i+ i,f? i+ i) and hence 
is at most k. Therefore the S-T chain C that is obtained from C by placing 
(Ai U {v}, Bi) between (Ai, Bi) and (A i+ \, B i+ i) is gapless and of order at most 
k. We have 

S(C') = 5(C) - max{0, \A i+1 \ Ai\ - 1} - max{0, |Bi \ B i+1 \ - 1} 

+ max{0, |{v}| - 1} + max{0, \B t \ Bi\ — 1} 

+ max{0, \A i+l \ (Ai U {■u})| - 1} + max{0, |Bi \ B i+1 \ - 1} 

= <5(C') - max{0,|A i+1 \ A*| - 1]} + max{0, \A i+1 \ (A t U {u})| - 1}. 

Since \A i+ \ \ Ai\ > |A i+ i \ (Ai U {v})| > 0, it follows that 5(C') < 5(C) — 1, a 
contradiction. We similarly obtain a contradiction from the case | Bi \ Bi + 1 | > 2 
as well. □ 

The following lemma provides our branching type recurrence. 

Lemma 6. Suppose G has a gapless S-T chain of order at most k and sup¬ 
pose that | V(G) \ (S U T)\ > k + 2 holds. Then, there are a gapless S-T 
chain ((Aq,Bq), ..., (A r , B r )) of order at most k and a pair of distinct vertices 
u £ V(G) \ (S U IV - [T]) and v £ V(G) \ (T U _/V + [,S']) such that the following 
holds: 

1. ((Ai, Bi),..., (A r , B r )) is an (S U {u})-T chain, 

2. ((Aq, Bq), ... , (A r -i, B r -i)) is anS-(TU{v}) chain, and 

3. ((Ai, B \),..., (A r _ i, B r _ i)) is an (S U {rt})-(T U {t>}) chain. 

Proof. Suppose G has a gapless S-T chain of order at most k. By Lemma [5J 
G has a gapless S-T chain C = ((A 0 , Bo), (A\, B\),..., (A r , B r )) of order at 
most k that is tight and nice. Since C is tight, we have B r = We also 

have Bq = V(G) \ S from the definition of an S-T chain. Therefore, Bq\ B r = 
V(G) \ (SUN- [T]) and this set contains at least two vertices as we are assuming 
| V(G) \(S U T )| > k + 2. Similarly A r \ Aq = V(G) \ (A f+ [5 I ] U T) has at least 
two vertices. Let i\ denote the smallest i such that 0 < i < r and \ Bi\ = 1 
and ii the largest i such that 0 < * < r and \A i+ i \ A,;| = 1 Since C is nice, 
the choice of i\ and i 2 implies that Bi = B 0 for 0 < i < i\ and Ai = A r for 
i 2 < i < r. Let u be the unique vertex in B il _i \ B ix and v the unique vertex 
in Aj 2+1 \ A,; 2 . We must have i\ < i 2 , since otherwise A ix fl B^ = A r C I (B 0 \ 
{u}) = (V(G) \ T) n (V(G) \(S U {«})) = V(G) \ (S U T U {«}) and hence 
| V(G) \ (S U T)\ < | Aq fl B.jJ + 1 < k + 1, contradicting our assumption. 

Since u ^ B ^ and v ^ A^ C Ai 2 , we must have u ^ v. Let C be the 
separation chain ((A i:1 , B.^),..., (A i2 , B i2 )) Then, (A 0 ,B 0 ) + C + (A r ,B r ) is 
a S-T chain since B 0 = V(G) \ S and A r = V(G) \ T, it is gapless since 
\B 0 \ B n | = 1 and | A r \ A,; 2 | = 1, and it is clearly of degree at most k. Since 
B h = T/(G)\(SuM)andA,; 2 = V(G)\(TU{i>}), (A 0 , B 0 )+C' is an 5-(TU{w}) 
chain and C' + (A r , B r ) is an (S U {u})-T chain. Therefore, the separation chain 
(Aq,Bq) + C' + (A r , B r ) qualifies as the S-T chain claimed in the lemma. □ 
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Given these recurrences and the base case above, our algorithm is straightfor¬ 
ward. Suppose we are given a fc-admissible pair ( S,T ). If |V'(G)\(5UT)| < fc + 1 
holds then we apply Lemma [2] and return the gapless S—T chain it provides. Sup¬ 
pose otherwise. We test if there is a minimum S—T separation that is non-trivial: 
a minimum S-T separation (X, Y) that is not equal to either (X + [5], V(G)\S) 
or ( V(G ) \ T, X _ [T]). If we find one, we apply Lemma H] and recurse on sub¬ 
problems (S, Y \ X) and (X \ Y, T ). If either of the recursive calls returns a 
negative answer, we return a negative answer. Otherwise, we concatenate the 
solutions from the subproblems as prescribed in Lemma [I] and return the result. 
Finally suppose that there is no minimum S-T separation that is non-trivial. 
If (JV + [S], V(G) \ S) is the only minimum S-T separation, then we recurse on 
(SU{u}, T) for every v £ V(G)\ (SUT) such that (5U{?;}, T) is fc-admissible. 
If (V(G) \T, N~[T]) is the only minimum 5-T-separation, then we similarly 
branch from T. If both (N+[S], V(G)\S) and ( V{G)\T , N~[T]) are the min¬ 
imum S-T separations, then we branch from both sides. In either case, if any 
of the recursive call returns a gapless separation chain of order at most k , we 
trivially extend the chain into a gapless S—T separation of order at most k and 
return this chain. Otherwise, that is, if all the recursive calls return negative 
answers, we return a negative answer. 

The correctness of this algorithm is proved by a straightforward induction 
for which the above Lemmas provide the base case and the induction steps. 

We analyze the running time of the algorithm. The following observation 
extends the one in na that the number of vertices of out-degree at most k in a 
semicomplete digraph is at most 2k + 1. 

Proposition 1. Let G be an h-semicomplete digraph and let U C V{G). Then 
the number of vertices v £ V(G) \ U such that d + (U U {w}) < k is at most 
h + 2k + 1 for every k > 0. The similar statement with the out-degree replaced 
by the in-degree also holds. 

Proof. Fix I/, let X C V(G) \ U be arbitrary, and set |X| = b. By the definition 
of /i-semicomplete digraphs, G[X] contains at least b(b — h — l)/2 edges and 
hence the average out-degree of vertices in G[X] is at least (b — h — l)/2. For 
each v £ X, Nq(U U {d}) contains N^ x ^(v) and hence if b > h + 2k + 1 then 
there is at least one v £ X such that \Nq(U U {u})| > k. This proves the first 
statement. The second statement is immediate by symmetry. □ 

Thus, the number of vertices to branch on from each side in the above algo¬ 
rithm is bounded by h + 2k + 1. 

To measure the “size” of the problem instance ( S , T), we introduce the follow¬ 
ing two functions. Let 7(5, T) denote the order of the minimum S—T separation. 
Let /j,(S,T) be defined by 

p(S,T) = 2\V(G) \ (N+[S\ U N~[T])\ + \N+(S)AN~(T)\, 
where XAY is the symmetric difference between X and Y. 
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Lemma 7. Let ( X , Y) be a minimum S-T separation. Then, we have 
p(S, Y\X) + n(X\ Y, T)=n(S,T). 

Proof. Since (X , Y) is a minimum S-T separation, we have N + (X\Y ) = N~{Y\ 
X) = XnY and hence N + [X \ Y] = X and N~ [Y \ X] = Y. We define pairwise 
disjoint vertex sets Co, C\, and C 2 by 

Co = X n Y \ (N+ (S) U N~ (T)) 

Cl = XnYn(N + (S)\N~(T)) 

C 2 = XnYn(N-{T)\N+(S)). 

Then, noting that [X fi Y) \ N~(T) = Co U Ci and that N~(T) \ (X (~l Y) = 
N~(T) \ X since N~(T) (~l (X \ Y) = 0, we have 

H{X \ Y, T) = 21 V(G) \ (N+[X \ Y] U N~[T})\ + \N+(X \ Y)AN~{T)\ 

= 2|V(G)\(IU N~[T])\ + \{X nY)AN~(T)\ 

= 2\V (G) \ (X U N~[T])\ + |C 0 | + |Ci_| + \N~{T) \ X\. 


Similarly, we have 

H(S, Y\T) = 2\V (G) \ ( N + [S} U F)| + \N + (S)A{X nT)| 

= 2\V (G) \ (N+[S] U Y)\ + |G 0 | + |C 2 | + \N+(S)\Y\. 

Moreover, we have 

I V(G) \ (N+[S\ U N~[T])\ = | V(G) \ (Y U N+[S))\ + \V[G) \ {X U N~[T})\ + |G 0 | 
and 

\N+(S)AN~(T)\ = \C\\ + |C 2 | + |A^ + (S') \ V| + \N~(T) \ X\. 

Therefore, we have 

p(S, T) = 2\V{G) \ (^[S 1 ] U N~ [T])| + \N+{S)AN~ {T)\ 

= 2| V(G) \(Y U N+[S})\ + 2|(V(G)\(IU N~[T\)\ 

+2\C 0 \ + IGrl + |C 2 | + | N + (S) \ Y\ + \N~(T) \ X\ 

= p(S, Y\X) + p(X\Y, T) 

as claimed in the lemma. □ 

Lemma 8 . Let (X, Y) be a non-trivial S-T separation: X\Y ^ S and Y\X 7 ^ 

T. Then, we have p(S, Y \ X) > 1 and p(X \ Y, T) > 1. 

Proof. Due to the symmetry it suffices to prove the first inequality. From the 
assumption, there is some vertex v £ (X \ Y) \ S. Since N~ [Y \ X] C Y, we 
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have v N \Y \ X\. If v £ N + (S) then v £ N + (S)AN (Y \ X) and otherwise 
v £ V(G) \ (AT+[5] U N~[Y \ X]). Therefore, in either case, we have 

KS, Y \ X) = 2| V(G) \ (N + [S\ U N~[Y \ X])| 
+\N + (S)AN~(Y\X)\ 

> 1 . 


□ 

Let R(S, T ) denote the number of problem instances recursively considered 
when we solve the instance (S, T), not counting the instances in the base case, 
but counting the instance ( S , T) itself unless it is in the base case. Let p'(S , T) = 
max{0, 2p(S, T) — 1 }. 

Lemma 9. Let G be an h-semicomplete digraph and k a positive integer. Then, 
for each k-admissible pair ( S,T ), we have 

R(S , T) < n'(S, T) ■ (h + 2k + i) 2 (fc-7(s,r» 

Proof. The proof is by induction on the structure of recursive calls. If instance 
(S, T) belongs to the base case | V{G) \ (S U T)\ < k + 1, then R(S,T ) = 0 
by definition and inequality ((2]) trivially holds. Note that if p,(S,T) = 0 then 
V(G) \ (S U T) = N~(S) = N + (T) and (S, T) belongs to the base case. We 
next consider the case where, in processing the instance (S, T), the “divide-and- 
conquer” recurrence is applied and instances ( S,T') and ( S',T ) are recursed on. 
We have a non-trivial minimum separation (X, Y) of ( S , T) such that S' = X\Y 
and T' = Y\ X. By Lemma[7l we have p(S, T) = fi(S , T') + T). Moreover, 

by Lemma[8l we have p(S,T') > 1 and fj,(S',T ) > 1. Therefore, we have 

M , (5,T) = 2/x(5,T)-1 

= (2/x(5,r)-l) + (2/x(5 , ,T)-l) + l 
= ^(S,T') + p'(S',T) + 1. 

Moreover, we have 7 (S,T') > 7 (S,T) since every S-T’ separation is a S-T 
separation and similarly 7 (S',T) > Applying the induction hypothesis 

to the instances ( S,T') and ( S',T ), we have 

R(S, T) = 1 + R(S, T') + R(S', T ) 

< 1 + (//(S', T') + p'(S', T)) ■ & 2 ( fc -')'(ST)) 

< 1 + (fJ?(S,T) - 1) • &2(fc-7(S,T)) 

< p.'(S,T) ■ & 2 (fe- 7 (S,T)) ; 


where b = h + 2k+l, that is, inequality ©■ We next consider the case where the 
branching recurrence is applied. We have three cases to consider: (1) (iV + [S], V(G)\ 
S) and ( V(G)\T , IV - [T]) are the only minimum S-T separators, (2) (iV + [S], V(G)\ 
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S) is the only minimum S-T separator, and (3) (V(G) \T, 7V _ [T]) is the only 
minimum S-T separator. First consider case (1). In this case, for each pair of 
vertices u £ V \ (5 U N~[T ]) and v £ V \ (1V + [S'] U T ) such that the pair 
(5 U {it}, T U {v}) is fc-admissible, the instance (S U {u}, T U {v}) is recursed on. 
By Proposition |TJ the number of such pair is at most b 2 = (h + 2k + l) 2 - For 
each pair of u and v, we have by the induction hypothesis 

R{S U M, T U {u}) < n'(S U M, T U {u}) • & 2 (fc- 7 (Su{u},suM))_ 

Since no (S U {u}-(TU {it}) separation is a minimum S-T separation from the 
assumption of this case, we have 7 (S U {u}, T U {i>}) > 7 (S, T). Moreover, since 
fi(S U {it}, T U {i>}) < ^(S 1 , T) and /i( S , T) > 0, we have p!{S U {it}, T U {u}) < 
f j'{S,T ). Therefore, we have 

R(S, T)< 1 + ^2 R (SU {it}, T U {v}) 

U,V 

< 1 + b 2 ■ (n'(S,T) - 1) • &2(fc-7(S,T)-l) 

< IU,'{S,T) ■ &2(fc-7 (S,'T)) j 


that is, inequality ([2}. Cases (2) and (3) are similar and somewhat simpler. □ 

The time for processing each pair (5, T) excluding the time consumed by 
subsequent recursive calls is dominated by the time for finding minimum S- 
T separation and for deciding if there is a minimum S-T separation that is 
not trivial. This can be done in n°^ time by the repeated use of a standard 
augmenting path algorithm for a minimum S-T cut. Since /i , (0,0) = O(n), we 
have the running time claimed in Theorem [T| 

4 Tame obstacles survive random sampling: proof of 
Theorem [2] 

We prove Theorem [2] in this section. 

Let G be a semicomplete digraph with n vertices. For 0 < d < n, let V< d (G), 
V> d {G), V< d (G), and V> d (G) denote the set of vertices v with cIq(v) < d, 
dQ^v) > d, dg(u) < d, and dg(u) > d, respectively. We omit the reference 
to G and write V< d etc. when G is clear from the context. 

Proposition 2. For every 0 < d < n, we have V+ d C V^ n _ d _ 1 and V^ d C 

v >n-d-1- 

Definition 1. |T7f Let G be a semicomplete digraph and let d > 0, l >0 and 
k > 0 be integers. A (d, l, k)- degree tangle of G is a vertex set T C V> d D V< d+k 
with |T| = l. An (d, l, fc)-matching tangle of G is a pair of vertex sets 
with |Xi| = IT 2 I = l such that: 
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1- T\ C V< d , T 2 C V> d _ ( _ fc+1 , and 

2. there is some bijection (f> : Tf —> T 2 such that ( v , (f>(v)) £ E(G) for every 
v £ T\. 

We will often refer to a (d, l, k)-degree (-matching) tangle as an (l,k)-degree 
(-matching) tangle without specifying d. 

Lemma 10. Let G be a semicomplete digraph on n vertices. Then, for each pair 
d\ and c ?2 of non-negative integers such that di -M 2 < n, we have \V> di fl V > d2 1 < 
n - (di + d 2 ) + 2pw(G). 

Proof. Fix an optimal nice path-decomposition Xq. X\,..., X 2 n of G, where 
n = |V(G)|. We say that vertex v is introduced at i if Xj \ W_i = {i>} and 
forgotten at i if W-i \ Xj = {u}. Let io denote the smallest index i such that 
a vertex in V> d fl V(f d2 is forgotten at i + 1; we let vq denote this forgotten 
vertex. Similarly, let i\ be the largest index i such that a vertex in V(f di fl V^ 2 
is introduced at *; we let v\ denote this vertex. If *o > *1 then V> d fl V> d2 C Xi 0 
and hence | V> di fl V> d J < pw(G) + 1; we are done. So suppose that io < i\- 
Let Y 0 = U/<i 0 Xj and \\ = Xj. Since 7V + [uo] C Yq , by the definition of 

path-decompositions, and d + (uo) > d\, we have |Yo| > di + 1. Similarly, since 
IV - [vi] C Yi and d~(v 1 ) > d 2 we have |Yi| > ^2 + 1. Let Z be the set of vertices 
in V>V D V> d2 that are introduced at some i > io and forgotten at some i! < i\. 
Then, each vertex in ( V> d fl V)f d ) \ Z must be in X io if it is introduced at some 
i < io and in Xj, if it is forgotten at some i > i\. As To U Y\ C V(G) \ Z, we 
have 


|lo U Yi| < n - \V? dl n J + |(F>+ i n V^J \ Z| 
< n -\v^ di nv> d2 \ + \x i0 ux il \. 

We have Y 0 fl Y\ = X io fl Xj, from the definition of a path-decomposition and 
hence | Yo \ + \ Y\ \ < n — \ V> di fl V> d3 \ + \ X io | +1 X tl \. Combining with the bounds 
on |Yo| and |Yi| above, we have 

\V>di n V>d 2 1 — n ~ (^1 + !) — (^2 + 1) + \Xi 0 \ + \X ix \ 

< n — (di + c? 2 ) + \Xj 0 1 + | Xj ,| — 2 

< n - (di + c? 2 ) + 2pw(G). 


□ 


Corollary 1. If G has an ( l,k)-degree tangle then pw(G) > (l — k — l)/2. 

Proof. Let The a (l, k )-degree tangle. Then, ^ 5= V>dHV<d+k - ^>cP^> n - (d +k)— 1 
for some d and hence l < n— (n—k— l) + 2pw(G) = /c + l + 2pw(G) bv Lemma [TUI 
The corollary follows. □ 

Remark 1. The lemma in H2 states that if G has a (5 k + 2, fc)-degree tangle 
then pw(G) > k. The above corollary implies a slightly stronger statement that 
if G has a (3 k + 2, fc)-degree tangle then pw(G) > k. 
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The following lemma generalizes the analysis of on matching tangles in HU- 
We need this generalization when we introduce another obstacle for small path- 
width. 

Lemma 11. Let G be a semicomplete digraph on n vertices and let l, k, d 
positive integers. Suppose G has a set of l pairwise vertex-disjoint directed paths 
from Vfj d to Vjf d+k . Then, pw(G) > min{Z,/c}. 

Proof. Let Q be a set of l pairwise vertex-disjoint directed paths from Vff d to 
Vjf d+ k- We assume pw(G) < k — 1 and show that pw(G) > l. Let Xq, ..., X 2n 
be a nice path-decomposition of G of optimal width (which is k — 1 or smaller). 
Let Ai = U j<i X j and B l = \J d>i Xj for 0 < i < 2 n. Since \Ai\ + \Bi\ = 
n + \Ai fl Bi\ < n + k — 1 holds for 0 < i < 2 n, there is some i such that 
\Ai\ < d + k and \Bf\ <n — d—l. Fix such i. For each v Ai, iV _ [u] C Bi and 
hence d~(v) < n — d — 2. Therefore, we have Vjf (i C Vfj n _ d _ 1 C A t . Similarly, 
for each v qL Bi, 7V + [u] C Ai and hence d + (v) < d + k — 1. Therefore we have 
V> d+k C Bi. Therefore, each path in Q from Vff d to V> d+k is from A t to Bi 
and must have at least one vertex in Ai n Bi since (Ai, Bi) is a separation and 
hence there is no edge from Ai \ Bi to Bi\Ai. As the l paths in Q are pairwise 
vertex-disjoint, we have l < \Ai fl Bi\ < pw(G). □ 

Corollary 2. fl 7| / If a semicomplete digraph G has a (l , k)-matching tangle, 
then pw(G) > min{Z, k + 1}. 

Proof. Let (Ti,Tf) be a (l, fc)-matching tangle and let d be such that Tj C Vjf d 
and T 2 C V> d+k+1 . Apply Lemma ITTl to the set of l vertex-disjoint paths from 
V<d V>d+k+i P rov ided by the matching edges. □ 

We follow the scenario described in the introduction. Given an /i-semicomplete 
digraph G of pathwidth at least f(h,k), we complete it into a semicomplete di¬ 
graph G' on V (G), in which we find a large obstacle, say a degree tangle T. Then, 
we apply Theorem [4] to obtain a random independent set / of the complement 
of the underlying graph of G. We hope that T fl I is a tangle of G[I\ that is 
strong enough to conclude pw(G[/]) > k. For this to happen, we need to have 
the out-degrees |(v) fl I\ of v, for v £ T fl /, to be close to each other. 

As observed in na, the optimal vertex separation sequence lists the vertices 
roughly in the order of increasing out-degrees and therefore each vertex has most 
vertices of smaller degree as its out-neighbors, except for some exceptions. The 
following notion of the wildness of vertices measures how exceptional a vertex 
is. 

Definition 2. For each vertex v £ G, we define the wildness wld(u) of v by 

wld(v) = \V^ d+{v) \N + (v)\. 

Lemma 12. Let G be semicomplete and v an arbitrary vertex of G. Then, for 
each integer w > 0, we have 

\ V <d+(v)-w n N ~( v )\ ^ wld (^) -w- 2pw(G) - 1 


16 


and 


I V >d+(v)+w n N+ ( v )\ > wld(t;) -w- 2pw (G). 

Proof. For the first inequality, first observe that 

\ v <d+(v) n N~(v )| > |F+ +W \ AT+(r;)| - 1 

= wld(u) — 1, 

since each vertex not in N~(v ) must be in N + (v) U {u}. Since 

\^<d.+ {v) \ ^<d+ (v)—w I — \^ r >n—d+(v)—l ^ ^>(1+ (v)—w +ll 

< W + 2pw {G)n 

by Lemma [TO] (or trivially holding when w = 0 and hence Lemma [TO] is not 
applicable), we obtain the first inequality. 

For the second inequality, we have |^< d+( „ )+tl) _ 1 | < l y >n-(d+(„) +l «)l < d+(v)+ 
w + 2pw(G) by Lemma 1 101 and hence 

14^-r n N + (v) I < d + (v) +w + 2pw (G) - \V+ d \ N + (v)\ 

< d + (v) + w + 2pw(G) — \K d+{v) \ N+ (v)\ 

= d + (v) + w + 2pw(G) — wld(-y). 

Therefore, of the d + (y) vertices in N + (v ), at least wld(u) — w — 2pw(G) must 
belong to V^ d+(v)+w . □ 

If the vertices of a degree-tangle T have small wildness, then most of their 
out-neighbors are shared and we may expect that their degrees in the sampled 
subgraph G[I] will be close to each other. We call such a degree-tangle tame. 

Definition 3. We say that an (l,w)-degree tangle T of G is tame (relative to 
the parameters l and w), if wld(u) < 3 1 + w + 2pw(G) for each v € T. 

A degree-tangle is not necessarily tame, but a large number of wild vertices 
in a degree-tangle are themselves an evidence of large pathwidth. We capture 
this fact by another type of obstacles we call spiders. 

Definition 4. Let G be a semicomplete digraph and let d > 0, l > 0, and w > 0 
be integers. A (d,l,w)- spider is a triple (T,L,R), where T is a vertex set with 
\T\ > l, L is a family {L v | v € T} of vertex sets, and R is a family {R v \ v £ T} 
of vertex sets, such that the following holds for each v £ T: 

1. L v C N~(v), 

2. \L V \ > 31, 

3. d + (u) < d for each u £ L v , 
l R v C N+(v), 

5. \R V \>31, and 
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6. d + ( u) > d + w for each u £ R v . 

We will sometimes refer to a (d, l, w)-spider as an ( l, w)-spider, without specify¬ 
ing d. 

Lemma 13. If a semicomplete digraph G has an (l, w)-spider then pw(G) > 
min{l, w}. 

Proof. Let (T, L, R) be a (d,l, iu)-spider of G. Let T' be an arbitrary subset of T 
with \T'\ =1. For each v £ T r , select l v £ L v and r v £ R v so that, for each distinct 
pair u,v £ T', we have {«,, l u , r u } D {v, l v , r v } = 0. Since \L V \ > 3 1 and |i?„| > 3 1 
for each v £ T, such a selection can trivially be done in a greedy manner. We 
have a set of l pairwise vertex-disjoint paths from V< d (G) to Vff d+w+l (G) and 
hence by Lemma [Til we have pw(G) > min{l,u;}. □ 

The following lemma shows that spiders capture what we intend them to 
capture. 

Lemma 14. Suppose G has a (21, w)-degree tangle T. Then, G has either a 
tame (l, w)-degree tangle or an ( l, w)-spider . 

Proof. Let U = {v £T \ wld(v) <3 l + w + 2pw(G)}. If \U\ > l then U contains 
a tame (Z, tu)-degree tangle and we are done. So, suppose otherwise. Let d be 
such that T C V> d D V< d+W - For each v £ T \ U, let L v = V< d D N~(v) and 
R v = V+ d+w fl N + (v). Fix v £ T\U. As wld(u) > 3 1 + w + 2pw (G), we have, 
by Lemma [T2l 


\L V \ > wld(v) — (d + (v) — d) — 2pw(G) — 1 

> wld(u) — w — 2pw(G) — 1 

> 3 1 

and similarly |I?„| > 31. Therefore, the triple ( T\U , L , R) is a (d, l, w)-spider. □ 
We similarly define the tameness of matching tangles. 

Definition 5. We say that a (d, l, w)-matching tangle (TijTh) of G is tame if 

1. wld(u) <3 1 + d + w — d + (v) + 2pw(G) for each v £ T\ and 

2. wld(u) <3 1 + d + (v) — d + 2pw(G) for each v £ T?. 

Lemma 15. Suppose G has a (d, 31, w)-matching tangle (Ti,Tz). Then, G has 
either a tame (d, l, w)-matching tangle or a (d, l, w)-spider. 

Proof. Let I\ = {v £ T\ \ wld(u) > 31 + d + w — d + (v) + 2pw(G)} and I 2 = {v £ 

T 2 I wld(u) > 31 + d + (u) —d + 2pw(G)}. If |/i| < 1 and \I 2 \ < 1 then there is some 

T[ C Ti \ 1 1 and T' 2 , C T 2 \I 2 with \T[\ = \T%\ =1 such that there is a matching 
from T[ to T' 2 by edges of G: (T[,T!f) is a tame (d, l, w)-matching tangle. 
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Suppose otherwise. We first consider the case where |/i| > l. For each v £ I\, 
let L v = V< d+ ^ (~l N~(v) and R v = V> d+W D N + (y). Applying Lemma l~H?l and 
using the assumption wld(u) >3 l + d + w — d + (v) + 2pw(G) , we have 

\L V \ > wld(u) — 2pw(G) — 1 

> 3 1 + d + w — d + (v) 

> 3 1 


and 


|i?„| > wld(u) — (d + w — d + {v)) — 2pw(G) 
> 31. 


Therefore, R) is a (d, l, w)-spider. In the case |A 2 1 > l, we have a (d, l, w)~ 

spider similarly constructed on I 2 . □ 

We also need to define the tameness of spiders. 

Definition 6. Let (T,L,R) be a (d, l, w)-spider. We say that a vertex u £ 
U V £ T L v is tame (relative to the parameters d, l, and w) if wld(rt) < 3 1 + d + 
w — d + (u) + 2pw(G). Similarly, u £ [J^pt * s t ame */wld(u) <3 1 + d + (u) — 
d+ 2pw(G). We let L‘ ame and i?* ame denote the set of tame vertices in L v and 
R v respectively. We say that a (d, l, w)-spider ( T,L,R ) is tame if |L* ame | > 21 
and |i?* ame | > 21 for every v £ T. 

We say that a ( l,w)-spider is tame, if it is a tame (d, l, w)-spider for some 
d. 

Lemma 16. Let G be a semicomplete digraph and suppose that G has an ( l,w )- 
spider, where w > 0. Then, G has a tame (l, w')-spider for some w' > w. 

Proof. Suppose G has a (d, /, w)-spider (T, L, R). We may assume that w is the 
largest possible given l: for every (d',l,v/)- spider of G, we have w' < w. Under 
this assumption, we show that the spider ( T,L,R ) is tame. For contradiction, 
suppose not. We consider the case where there is some v £ T such that |L* ame | < 
2 1; the case where there is some v £ T such that |l?* ame | < 21 is similar. Fix such 
v and let U = L v \ L‘ ame . Since \L V \ > 31 by the definition of a spider, we have 
\U\ > l. Let u be an arbitrary member of U and let w u = d + w — d + {u). Since 
d + (u) < d by the definition of a spider, we have w u > w. 

Since u is not tame, we have wld(rt) > 3/ + d + w — d + (u ) + 2pw(G) = 
3/ + w; u + 2pw(G). Let L' u = V(f d+(u) _ Wn r\N~(u) and R' u = V(f d+{u)+Wn nN+(u). 
We apply Lemma IT2l and have 

\L' U \ > wld(u) — w u — 2pw(G) — 1 

> 31 
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and 


\R' U \> wld(u) — w u — 2pw(G) 

> 31 . 

Since d + ( u ) — w u = d + w — 2^ < d — w and d + ( u ) + w u = d + w holds for 
every u GU, we have L' u C V< d _ w and R' u C V> d+W for every u G U. Therefore, 
(U, L' , i?') is a (d — w,l, 2w)-spider, contradicting the choice of w. □ 

To continue our scenario, we invoke the following result due to Pilipczuk. 

Lemma 17. (fT7), Theorem 32) There exists an algorithm , which given a semi- 
complete digraph G and integers k and l > 5 k, in time 0(k\V (G)| 2 ) outputs one 
of the following: 

— an (l + 2, k)-degree tangle in G; 

— a (k + 1, k)-matching tangle in G; 

— a path decomposition of G of width at most (l + 2k). 

The following lemma, building on this lemma and previous lemmas, shows 
that a semicomplete digraph of large pathwidth has a tame tangle or a spider. 

Lemma 18. Let K be a positive integer and G a semicomplete digraph with 
pw(G) > 128A'. Then, G has at least one of the following: 

1. a tame (46 K, 18A') -degree tangle; 

2. a (6 K, 18K)-spider; 

3. a tame ( 6K,18K)-matching tangle. 

Proof. We apply Lemma [T71 to G with l = 92A' — 2 and k = 18A'. Since G does 
not have a path-decomposition of width l + 2k = 92 K — 2 + 36A' = 128^ — 2, the 
algorithm finds either a (92AT, 18A')-degree tangle of G, or an (18A' + 1,18A')- 
matching tangle of G. In the first case, by Lemma HU G has either a tame 
(46A', 18A')-degree tangle or a (46AT, 18A')-spider, which certainly contains a 
(6.A, 18AT)-spider. In the second case, G has a (181^, 18A')-matching tangle and, 
hence by Lemma fl5l either a tame (6AT, 18A')-matching tangle or a (6AT, 18AT)- 
spider. □ 

Lemma 19. Let h be a positive integer. Then, there is some positive integer kh 
such that the following holds. Let k > kh be an integer and let K = (h + 1 )k. Let 
G be an h-semicomplete digraph and suppose a semicomplete supergraph G' of 
G with vertex set V(G) and with pw(G') < 140A' has a tame (46A', 18I\)-degree 
tangle. Then G has a semicomplete subgraph with a (21k, 10k)-degree tangle. 

Proof. Let T be a tame (46A', 18A')-degree tangle of G'. Let G denote the com¬ 
plement of the undirected graph underlying G. The maximum degree of G is h 
or smaller. We apply Theorem [U to G to obtain a random independent set / of 
G. The probability of each vertex being in I is p = 2 (h+i ) • For each S C V(G), 
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the expectation of |5fl I\ is p\S\ and the probability of deviations is bounded as 
in Theorem 01 

That I is independent in G implies that G[/], which equals G'[I], is semi- 
complete. We show that T fl / contains a (21 k, 10A:)-tangle of H = G[I] with 
high probability. 

We call the event \T D I\ < 21 k the bad event on \T D I\. 

Since E[|T (~1 /|] = 46 pK = 23 k, the probability of this bad event is at most 

Pr (23 k - \T n I\ > 2k) < exp 

= exp ( _ w^i)) 

by Theorem 01 

Let d be such that T C V> d {G r ) fl T4+ J+lgK (G / ). For each v G T fl /, we 
evaluate d~jj(v) as follows. 

d+(v) = \N+ G ,(v)ni\ 

= I VUG') n i\ - |(P<+(GO \ N+G'(v)) ni\ + |(P>Vi(G0 n n+ g .{v)) n 1 1. 

The deviation of the first term is common for all v: A = \V< d (G') fl I\ — 
E[|V0d(GO n/|] = |y+ d (G0 fl/| — p|V<^(G')|- Therefore, we are concerned with 
the deviations of other terms depending on v. 

Let X v = V< d (G')\N + G’( v ) an d Y v = V> d +\{G') D N + G'[y) for each v € T. 

As the (46 K, 18AT)-degree tangle T of G' is tame, we have wld(u) < 3 • 46 K + 

18 K + 2pw(G0 < 436A' and hence 

\X V \ < \ n+ G'{v)\ 

= wld(u) 

< 436 K. 

Since 

\V^ d (G')\=n~\V^d +1 (.G')\ 

>n — (n — d— 1 + 2pw(G0) 

>d+ 1 - 280 K (2) 

by Lemma [TUI and hence 

\VUG')\>d+,(v)-298K+l, (3) 

we have 


\Y v \=d+(v)-(\VUG')\-\X v \) 

< 298K + 436A' 

< 7MK. 
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Call the event \\X V D I\ — p|X„|| > j the bad event on X v and the event 
||Ft, n/| — p\Y v \\ > | the bad event on Y v . By Theorem[4l the probability of the 
bad event on X v is smaller than 

2eXP (-4^Kl) i2eXP ("62784^1)) 
and, similarly, the probability of the bad event on Y v is smaller than 

2CXP 105696(h + 1)) ' 

Therefore, setting say, kh = 10 7 {h + l) 2 , it follows from our assumption k > kh 
that, with probability close to 1, none of the bad events listed above occurs. 

Assume none of those bad events occur. Recall that A = \V+ d {G') (~l I\ — 
p\V< d (G')\. Then, for each v € T fl /, we have 

dM = |R<+(GO n i\ - \x v n i\ + I y v n i\ 

< p|’^<d(G' , )l + A - p\X v \ + - +p\Y v \ + - 

< pd%,{v) + A + | 

and, similarly, 


d+(v) >pd^,{v) + A - 


Therefore, for each v G T fl /, we have 


k k 

pd + A — — < d\j(v) < p(d + 18if) + A + — 


= pd + A + 
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Therefore, T fl I contains a (21 k, 10fc)-degree tangle of H. 


□ 


Lemma 20. Let h be a positive integer. Then, there is some positive integer kh 
such that the following holds. Let k > kh be an integer and let K = (h + 1 )k. Let 
G be an h.-semicomplete digraph and suppose a semicomplete supergraph G' of 
G with vertex set V(G) and with pw(G / ) < 140A" has a (6A", 18K)-spider. Then 
G has a semicomplete subgraph with a ( k,k)-spider. 

Proof. Since G' has a (6A', 18Af)-spider, by Lemma [TGl it has a tame (6A^, w)- 
spider for some w > 18A'. The approach is similar to the proof of Lemma 1191 
The only essential difference is that the wildness of a vertex in the spider may 
not be O(K) and the deviation of its out-degree in the sampled subgraph may 
be large. This is not an essential problem, however, since such a vertex with 
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large wildness has, by the definition of tame spiders, the original out-degree far 
away from the range to be avoided and therefore a large deviation is affordable. 

Let ( T,L,R ) be a tame (d, 6K, u>)-spider of G' , where w > 18 K. As in the 
proof of LemmaflTil let G be the undirected graph underlying G, p = 2 (h+i) > I the 

set of independent vertices of G sampled with probability p applying Theorem [2 
and H = G'[I] = G[I]. Let T' = T fl I and, for each v £ T' , let L' v = L* ame fl I 
and R! v = i?* ame D I. Our goal is to show that ( T' ,L',R ') is a (k,k )~spider of 
H = G'[I] with high probability. 

For this to happen, we need to have \T'\ > k and, for some d! and for each 

v e r, 

1. L' v C N~(v), 

2. \L' V \ > 3 k, 

3. dft(u) < d' for each u € L' v , 

4. R' v c N+(v), 

5- \R' V \ > 3fc, and 

6. dfl(u) > d' + k for each u £ R' v . 

We list “bad” events below that could prevent the above conditions from be¬ 
ing satisfied. We show that the probability of each of those events is exp(—L?(^)) 
and, since the number of those events is obviously O(kh), the probability is close 
to 1 that none of these events occurs under the assumption k > kh if kh is large 
enough. We also confirm that if none of those events occurs then the above 
conditions for ( T',L’,R') being a (k, fc)-spider are all satisfied. 

Since most of the analysis below is similar to the one we did for Lemma. If 91 
we omit some details, using ft notation rather than giving explicit constants in 
probability bounds, and emphasize what is different. 

First consider the event that \T fl I\ < k. Since \T\ > 6 K and hence E[|T fl 
J|] > 3k, the probability of this event is exp(—17(A)). Next consider, for each 
v £ T, the event that \L V fl I\ <3 k or \R V fl I\ < 3k. Since \L V \ > 9K and 
|i?„| > 9 K, the probability of this event is also exp(—L?(^)). If none of these 
events occurs, all conditions enumerated above are satisfied but those on the 
out-degrees on vertices in (J uST , L' u and in Uu<et' 

We proceed to events that may cause intolerable deviations of the out-degrees 
of those vertices. 

For each v £ U u &A L u me U i?* ame ), let X v = F^(G') \ N+(v) and Y v = 
V+(G') D Nq,(v). As in the proof Lemma fl9l we evaluate djj(v) (assuming 
v~£ I), as follows: 


d+(v) = \N+(v)ni\ 

= \vt d (G’)ni\-\x v ni\ + \Y v ni\. 

The deviation of the first term is common for all v: A = \V<AG') fl I\ — 

E[|^< + d (G') n J|] = I VUG 1 ) n J| - p\v£ d (G’)\. 

Therefore, our bad events concern about the deviations of \X V fl I\ and of 
| Y v fl 1 1 from their expectations. 
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First consider v £ (J u6 t L^ me . From the tameness condition and by LemmafTUl 
we have 

|V„| = |K+(G')\JV+(«)| 

s n \ N & Ml 

< d — cLq,(v) + 2pw(G / ) + wld(w) 

< 3(6AT) + 2 (d — cIq,(v)) + w + 4pw(G') 

< 578K + w + 2 (d — d,Q,(y)), 

and using Q, 


\Y v \=d+,(v)~(\V^ d (G')\-\X v \) 

< 298 K + \X V \ 

< 876K + w + 2(d- d£,{v)). 

Note that neither w nor d — c[q,(v) is necessarily 0{K). Our bad events on 
X v and Y v here are that \X V D I\ < p\X v \ — max{y, |(d — d|t,(v))} and that 
| Y v fl/| > p\Y v \ + max{y, |(d — dg;(u))} respectively. 

If ^ > \{d — d~Q,), then, noting that w > 18A' and hence | X v \ = 0{w) and 
\Y V \ = O(w), the probability of each of these events is 



= exp (—17 (p 2 A')) 



The other case is similar and the probability of each of these events is exp(—12(^)) 
in either case. We conclude that, with probability close to 1, none of the above 
bad events occurs. 

We analyze the out-degree of each vertex v £ (J u6 r' ^u assuming that the 
bad event on neither X v nor Y v occurs. We have 


= \vU G ') n J l - nI \ + \ Y * n 7 I 

= P(|y<+ (G')| - \Xy \ + |Y„|) + A + (p\Xy\ - \Xy O I\) + (\Yy O J| - P \Yy |) 
= pd^,{v) + A + {p\X V \ - \Xy 0 I |) + (\Yy n I\ ~p\Yy\). 

The sum of the last two terms is at most 2 max{^, |(d—djt, (u))} = max{^,p(d— 
d~Q,(v)) — + p(^ — djt,(u)). Therefore, we have 

djj(v) < pd + A + ^ (4) 

for each v £ (J ugT , ^4- 
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Next consider a vertex v £ U u gt -R„ ame - From the tameness condition, we 
have 

\x v \ = \vUg') \n+(v)\ 

< wld(i>) 

< 3 • 6K + cIq,(v) — d + 2pw (G') 

< 298 K + d%,{v) - d, 

and using J2]), 

\Y v \ = d+,(v)-(\VUG')\-\X v \) 

< 280K + d+,(v)-d+\X v \ 

< 578A' + 2(d£,(u) - d). 

Our bad events on X v and Y v here are that \X V fl/| > p\X V \ + | (dQ,(v) — d) 
and that |y„n/| < p\Y v \— |(dg, (v) — d) respectively. Since d^., (u) —d > w > 18 K, 
the probability of each of these bad events is exp(—17(-|)) and therefore, with 
probability close to 1, none of these bad events occurs for any v £ U«gt -R^ ame . 

We analyze the out-degree of each vertex v £ Uugt' R'u assuming that none 
of the bad events occurs. We have 

dn(v) = pd^,(v) + A- ( \X V n I\ -p\X v \) - (p\Y v \ - \Y V 0 /|) 

as before and the sum of the last two terms, neglecting signs, is at most 2? (dj, (v)— 
d) = | (dQ,(v) — d). Therefore, we have 

dx(v) > pd~Q, (v) + A- | (d£,(v) - d) 

2 T) 

> pd + A + — (d£, ( v ) - d) 

o 

2v 

> pd + A + -A-w (5) 

o 

for each v £ [J ligT , R'w From £U) and ©, we have 

min{d+(t-) | v £ |J K} ~ max{d+(w) | v £ (J L'J > ^ 

uGT' uGT' 

> 6 pK 
= 3 k. 

Therefore, (T', L' , R') is a (d', k , fc)-spider of H for some d!. □ 

Lemma 21. Let h be a positive integer. Then, there is some positive integer kh 
such that the following holds. Let k > kh be an integer and let K = {h + l)fc. 
Let G be an h-semicomplete digraph and suppose a semicomplete supergraph G' 
of G with vertex set V(G) and pw(G') < 140 K has a tame ( 6K , 18 K)-matching 
tangle (Ti,!^). Suppose moreover that the matching bijection f> of this tangle is 
such that the edge ( v , <f(v )) of G' for each v £ T\ is in fact an edge of G. 

Then G has a semicomplete subgraph that has a (k,k)-matching tangle. 
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Proof. Let G be the complement of the undirected graph underlying G. Let G 
be obtained from G by contracting the doubleton {u, <f>(v)} into a vertex, say t v . 
for each v £ T\. Let T = {t v \ v £ Ti}. Note that the maximum degree of G is 2h 
or smaller. Similarly to Lemma [201 we use Theorem [I] to obtain an independent 
set /' of G where the probability of each v £ V(G) belonging to I' is 2 { 2 h+i) • 

Let H = G[I} where / = ((V’(G') \ T) D X) U {u, (j>{v) \ v £ T\,t v £ I'}. As /' is 
independent in &, I is independent in G and hence G'[I\ = G[I] is semicomplete. 
By an analysis similar to the one in Lemma [001 setting kh large enough, we have 
\T (~l I\ > k and min^gThn/ d~jj(v) ~ max„ 6 Tin i d^(v) > k with probability close 
to 1. When this happens, (Ti fl /,T 2 fl I) contains a (k, /c)-matching tangle of 
H. □ 


We are now ready to prove Theorem O Fix positive integer h. Let kh be a 
constant large enough as required in Lemmas ITOl 1201 and [21] We set f(k,h) = 
128 (h + 1 )k for k > kh and f(k, h) = f(kh, h) for k < kh- 

Let G be an /i-semicomplete digraph of pathwidth at least /(fc, h). In the 
following proof that G contains a semicomplete subgraph of pathwidth at least 
k , we assume k > kh', otherwise we would prove that G contains a semicomplete 
subgraph of pathwidth at least kh > k. We set K = (h+l)k for readability. 

List the vertices of G as V\, ..., v n , in the non-decreasing order of out- 
degrees. Let G' be the semicomplete digraph obtained from G by adding edge 
(Vi,Vj ) for each pair i > j such that neither ( Vi,Vj ) nor ( Vj,Vi ) is an edge of G. 
By our assumption, pw(G') > pw(G) is at least 128AT. We assume below that 
pw(G') < 140A'; if this assumption does not hold, we choose k! > k such that 
128 (h + 1 )k' < pw(G') < 140 {h + 1 )k' and prove that G has a semicomplete 
subgraph of pathwidth > k'. 

Applying Lemma [TH] we obtain a tame (46A, 18AT)-degree tangle, a tame 
(6/\, w)-spider for some w > 18K, or a tame (6/v, 18A')-matching tangle of G'. 

If G' has a tame (46AT, 18A')-degree tangle, then G has a semicomplete sub¬ 
graph that contains (21k, 10fc)-degree tangle, by Lemma flTTl If G' has a tame 
(OAT, ui)-spider for w > 18K, then G has a semicomplete subgraph that contains 
a ( k , fc)-spider, by Lemma EH] Finally, suppose G' has a (6 K, 18AT)-matching 
tangle (Ti,T 2 ) with matching bijection cf. We observe that, for each v £ Ti, the 
edge (v, 4>(v)) of G' is in fact an edge of G, since 

d,Q((f>(v)) > d,Q,((f>(v)) — h > cIq,(v) + 18K — h > d,Q,(v ) 

> d+(v) 

and the edge addition rule for constructing G' from G dictates that if an edge 
between v and <f>(v) is added then it must be from (f>(v) to v. Therefore, Lemma l2l1 
applies and G has a semicomplete subgraph with a (fc, A:)-matching tangle. 

In either case, we conclude that G contains a semicomplete subgraph of 
pathwidth at least k. This completes the proof of Theorem [2] 
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5 Proof of Theorem [4j 


The goal of this section is to prove Theorem |TJ which we restate below. Graphs 
are undirected in this section and we use the following notation. For each v £ 
V(G), N G (v ) is the set of neighbors of v and Ng[v] = Ng(v) U { d }; for each 
U C V(G), N G [U] = U u6t r N a [u] and N a (U) = N G [U} \ U. 

Theorem [4j Let G be an undirected graph on n vertices with maximum degree 
d or smaller. Let p = 2 ^ +1 ■ Then, it is possible to sample a set I of independent 
vertices of G so that Pr(u £ I) = p for each v £ V(G) and, for each S C V(G), 
we have 

Pr^ n I\ > p|S| +t)< exp (^-^| ) 

and 

Pr(|S n I\ < p|S| - t) < exp . 

A naive sampling method is to keep a set V of candidate vertices and repeat¬ 
edly pick a random vertex from V to add to I, removing the selected vertex and 
all of its neighbors from V. This procedure would produce an independent set 
of cardinality at least n/(d+ 1). The exact probability of each vertex being in I, 
however, would depend on the structure of G. To achieve the uniform probabil¬ 
ity as claimed in the above theorem, we sample, at each step, from a d-regular 
supergraph of G[V] rather than from G[V] itself. 

We need the following theorem on regular completion of graphs due to Erdos 
and Kelly. 

Theorem 5. J7] Let G be an undirected graph on n vertices and d an integer 
such that d G (v) < d for every v £ V{G). Let t = Y^vevtG)^ ~ d G (v)). Then, 
there is a d-regular graph on n + m vertices that has G as an induced subgraph 
if and only if m satisfies all of the following four conditions: 

(1) md > t; 

(2) m 2 — m{d + 1) + t > 0; 

(3) m> d — d G (v) for every v £ V(G); and 

(4) (n + m)d is an even integer. 

Akiyama et al. PQ proved that, for every graph G on n vertices with maximal 
degree d or smaller, there is a d-regular graph on N < n + d + 2 vertices (N < 
n + d + 1 if nd is even) that contains G as a (not necessarily induced) subgraph. 
The following lemma states that every integer N > n + d + 1 with Nd even has 
that property. The proof is, naturally, analogous to the one in [T] . 

Lemma 22. Let G be a graph on n vertices with maximum degree d or smaller 
and N an arbitrary integer such that N > n + d +1 and Nd is even. Then, there 
is a d-regular graph on N vertices that contains G as a subgraph. 


27 



Proof. Let H be a maximal graph on V ( G ) with maximum degree d that contains 
all the edges of G. Let D = {v £ V(G) | dn(v) < d}. From the maximally 
of H, D must be a clique of H and hence |D| < d. It trivially follows that 
t = '52 ve v(G)(d ~ ^h{v)) < d 2 . Setting m = N — n>d+l, conditions (1), 

(2) and (3) of Theorem [5] are trivially satisfied. Condition (4) is also satisfied as 
we are assuming Nd is even. Thus, we may apply Theorem [5] to H to have a 
d-regular graph that contains H and hence G as a subgraph. □ 

We now describe the sampling procedure of Theorem 0] Fix a graph G on n 
vertices with maximum degree d or smaller. Let s = \n/(d + 1)]. We construct 
a sequence of pairs ( Ii,Vi ) for 0 < i < s, where 0 = Iq C Ii C ... C I s and 
V(G) = Vo 2 V\ D ... D V s . Our independent set I is I s . 

Fix i, 0 < i < s and suppose we have constructed and V+ We construct 4+i 
and Vi +1 as follows. Let rii = (2s — i)(d+l). Since i < s, we have rii > n + d+1 > 

1141 + d + 1. Moreover, riid is even as d + 1 divides n t . Therefore, Lemma 
applies and there is a d-regular supergraph Hi of G\Vi] on m vertices. We pick 
a vertex v of Hi uniformly at random. If v £ Vi then we set 1 = I; U {t>}; 
otherwise, we set I i+ 1 = p. In either case, we set V^ + i = Vi \ ({n} U Nh^v)). 

Since Hi is a supergraph of G[Fi], this ensures that v is independent, in G, of 
all vertices in Vi+i. By a straightforward induction, Ii is an independent set of 
G, Vi C V{G) \ Ii , and there is no edge of G between Ii and Vi, for 0 < i < s. 

Remark 2. To make Ii and Vi well-defined random variables for 0 < i < s, 
we assume that the d-regular supergraph Hi of G\Vi] used above is uniquely 
determined from V t and rii by some deterministic procedure relying on some 
predefined total order on V(G) for tie-breaking. 

Lemma 23. For each v £ V(G) and 0 < i < s, 

Pr(u £ 1 1 v £ Vi) = -— 

Ui 

Proof. The proof is by induction on s — i. The base case i = s is trivial. For the 
induction step, suppose i < s. Using the induction hypothesis, we have 

Pr(w £ I | v £ V) = Pr(u £ / i+ i | v £Vi) + Pr(u £ V i+ \ \ v £ V)Pr(v £ I \ v£ V i+1 ) 
1 rii — (d + 1) s — i — 1 

Tli TLi 

1 ^ n ,+1 s — i — 1 

Tli Tli Tli -\-1 

S — i 


Corollary 3. For each v £ V(G), we have 


Pr(w £ I) = 


1 

2{d + 1) 
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Therefore, we have, for each vertex set S C V(G), 


E[|5n/|] 


\S\ 

2(d + 1) 


We show that the value |STl/| is sharply concentrated around its expectation, to 
establish Theorem 2) We assume d > 1 in the following analysis: the case d = 0 
is trivial. 

Fix S C V(G). We first consider the case where IS 1 ! > We define a random 
variable Y,, for 0 < i < s by 


Yi = E[|5(~l/| | (I 0 ,V 


where the expectation is conditioned on the partial outcome of the experiment 
up to the construction of Jj and Vi. We have 


n = |Sn/|, 

y ” = E||Sn7 " = 2(ifT) 


and, for 0 < i < s, 


Yi = -E[Y i+1 | {I 0 ,Vo),{h,Vi),...,(IuVi)}, 


where the expectation is conditioned similarly to the above. Therefore, the se¬ 
quence Yq , ..., Y s is a martingale. 

We show that 


W-Y^l < 


3 

2 


( 6 ) 


holds for 0 < i < s. We have 


Yi = l-Sn/il + 
= \snii\ + 


Pr(u e / | v e Vi) 

vesnVt 

|5n foi|(s - i) 


rii 
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Since both | S' D | and the fraction (s — z)/n* are monotone non-increasing in i 
and |S (~l /j| — |S fl Ii~i\ < 1, we have Y t — Yi_i < 1. We also have 


Yj—i -Yi< 


IS 1 n Fj-rKs - (i - 1)) \SnVi\(s 


77 - 2—1 Tli 

(|S'nV r i _ 1 |-|5nVi|)(a -(*-!)) 


+|5n v^l 


rii -1 

s - (* - 1) s i 


< 


77 - 2—1 Tli 

(d+!)(«-(*-!)) , \SnVi 


rii -1 




< s- (i~ 1) , |S| 

— 2s — (* — 1 ) n 



i) 


and hence ©• 

We use the following form of Azuma’s inequality Let Xq, X\, ..., X m be 
a martingale with 


|A' l+ i — Xt\ < 1 


for all 0 < i < m. Let A > 0 be arbitrary. Then, 

Pr(A' m > X 0 + A y/m) < exp(-A 2 /2) (7) 


and 


Pr(X m < X 0 — \^/m) < exp(—A 2 /2) (8) 

Applying this inequality for martingale Y/ = |Y^, 0 < i < s = m, with 
A = -M=, we have 

3 Vs 1 

Pr(Y s > Y 0 + t) = Pr(Y s / > Y' + 

/ 

<ex H"^ 

£ “ p i~W\) 

and, similarly, 

Pr(y - < y„ - o < e Xp (-Tj), 

finishing the case where |S| > 

We turn to the case where |5| < We define a sequence io, ii, ..., i m 
of indices, where m = 31, that depends on the outcome of the sampling, 
inductively as follows. 
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1. i 0 = 0. 

2. For j > 0, ij is the smallest i > ij-\ that satisfies either of the following 
conditions: 

(1) i = s; 

( 2 ) Vi n 5 ± Vi,_ x n S; 

(3) i - ij -1 > 2]%y. 

Note that if ij = s for some j, then we have ij> = s for j < j' < m. We also 
note that i m = s, since, in determining ij for 1 < j < to, the second condition 
may apply at most l^l times and the third condition at most 2|S| times, but at 
most 21S'! — 1 times if the second condition applies at all. 

We define a random variable Zj for 0 < j < m by 

Zj = E[\sm\ I (Zo,Vi 3 ),(Ji,Fi),...,(4,^)], 

where the expectation is conditioned on the partial outcome of the experiment 
up to the construction of and V Xj . We have 

z m = \sni\, 

z »= E “ sn7|1 = 2(ifi)’ 


and, for 0 < j < s, 

Zj = -E[Z j+1 | (J 0 ,Fo),(/i,Fi),...,(4-,^)], 

where the expectation is conditioned similarly to the above. Therefore, the se¬ 
quence Zq, ..., Z m is a martingale. 

We show that 


I Zj Zj~ i| < 1 

holds for 0 < j < to. We have 

Zj = | <S* D Ii j | + 'y ' Pr(n £ I | v £ Vjj 

= |Sn/d + 


v&s nVi^ 


nu 


(9) 


Since both IS 1 (~l W | and the fraction (s — ij)/n.i j are monotone non-increasing 
in j and IS 1 fl | — |5 D < 1 by the second condition in the definition of 
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ij, we have Zj — Zj_ i < 1. We also have 


s ij -1 


£_< ip h- 1 

rii. ' n s 

~ \2\S\ + ) (d+l)s 
^11 
~ 2(d + l)|S| + (d+l)s 
^11 
“ 2(d+l)|S| + 2(d+l)|S| 
1 

“ 2|Sf 


by the third condition in the definition of ij and 


|<5 n Vij^ | - |STl | < d + 1 
by the second condition. Therefore, we have 

\s n Vil( s ~ i j- 1) I 5 n K/K s ~ 


1 -^3 — 


n b-i n b 

- ISnV^IXs-ij-r) 


+|snw 


5 X j — ]_ S X o 


%-i 


< 


< 


(d + l)(s — ij— i) IS'f'lV^ 


n ij _ 1 2|S’| 

s-ij -1 l-Sn^l 


2s — ij_i 2|5| 


< 1 


and hence ©. 

Applying Azuma’s inequality for this martingale with A = t/y/m , we have 
Pr (Z m > Z 0 + t) < exp 

- exp {~m) 

and 

Pr (Z m < Z 0 — t) < exp 
finishing the proof of Theorem |4] 




32 



















References 


1. J. Akiyama, H. Era, and F. Harary. Regular graphs containing a given graph. 
Elemente der Mathematik, 38: 15-17, 1983. 

2. N. Alon and J. Spencer. The Probabilistic Method. John Wiley & Sons, 1992. 

3. J. Bang-Jensen and G.Z. Gutin. Digraphs: theory, algorithms and applications. 
Springer Science & Business Media, 2008. 

4. H. L. Bodlaender. A linear-time algorithm for finding tree-decompositions of small 
treewidth. SIAM Journal on Computing, Vol. 25, pp. 1305-1317, 1996. 

5. M. Chudnovsky, A. Scot, and P. Seymour. Disjoint paths in tournaments. Advances 
in Mathematics, 270: 582-597, 2015. 

6. M. Chudnovsky and P. Seymour. A well-quasi-order for tournaments. Journal of 
Combinatorial Theory, Series B, 101(1): 47-53, 2011. 

7. P. Erdos and P. Kelly. The minimal regular graph containing a given graph. 
American Mathematical Monthly, 70: 1074-1075, 1963. 

8. S. Fortune, J. Hopcroft, and J. Wyllie. The directed subgraph homeomorphism 
problem. Theoretical Computer Science, 10(2): 111-121, 1980. 

9. F.V. Fomin and M. Pilipczuk. Jungles, bundles, and fixed-parameter tractabil- 
ity. Proceedings of the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete 
Algorithms, 396-413, 2013. 

10. A.O. Fradkin and P.D. Seymour. Tournament pathwidth and topological contain¬ 
ment. Journal of Combinatorial Theory, Series B , 103(3): 374-384, 2013. 

11. A. Fradkin and P. Seymour. Edge-disjoint paths in digraphs with bounded inde¬ 
pendence number. Journal of Combinatorial Theory, Series B, 110: 19-46, 2015. 

12. K. Kistunai, Y. Kobayashi, K. Komuro, H. Tamaki, and T. Tano. Computing 
directed pathwidth in 0(1.89") time. Accepted for publication, Algorithmica, 
2015. 

13. K. Kistunai, Y. Kobayashi, and H. Tamaki. On the pathwidth of almost semicom- 
plete digraphs. Proceedings of the Twenty-Third European Symposium on Algo¬ 
rithms, to appear, 2015. 

14. H. Tamaki. A Polynomial Time Algorithm for Bounded Directed Pathwidth. In 
Proceedings of the 37th International Workshop on Graph-Theoretic Concepts in 
Computer Science, WG2011, pp. 331-342, 2011. 

15. I. Kim and P. Seymour. Tournament minors. Journal of Combinatorial Theory, 
Series B, 112: 138-153, 2015. 

16. H. Nagamochi. Linear layouts in submodular systems. Proceedings of the 23rd 
International Symposium on Algorithms and Computation, 475-484, 2012. 

17. M. Pilipczuk. Computing cutwidth and pathwidth of semi-complete digraphs via 
degree orderings. arXiv preprint arXiv: 1210.5363, 2012. Conference version in Pro¬ 
ceedings of the 30th International Symposium on Theoretical Aspects of Computer 
Science, 197-208, 2013. 

18. N. Robertson and P.D. Seymour. Graph minors. XIII. The disjoint paths problem. 
Journal of Combinatorial Theory, Series B, 63(1): 65-110, 1995. 

19. N. Robertson and P.D. Seymour. Graph minors. XX. Wagner’s conjecture. Journal 
of Combinatorial Theory, Series B, 92(2): 325-357, 2004. 

20. B. Yang and Y. Cao. Digraph searching, directed vertex separation and directed 
pathwidth. Discrete Applied Mathematics, 156(10): 1822-1837, 2008. 


33 


